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1. Introduction 

Accurate simulations of multi-jet final states are important for current experiments and will 
become even more so once the LHC starts. At the LHC the production rate for such states 
will be large due to the huge available phase space. Hadronic multi-jet states are used for 
many of the discovery channels for new physics and the main irreducible background comes 
from QCD. A good theoretical understanding and accurate physics simulations of multi-jet 
QCD states are therefore essential tools for understanding and analyzing LHC data. 

To be able to compare the predictions of a model with a collider experiment, a descrip- 
tion of final state hadrons is needed. There are a few phenomenological models available to 
describe the production of hadrons, but they all require that the perturbative emissions are 
well described, especially in the soft and collinear regions, to give reliable results. These 
collinear and soft emissions dominate the multi-parton cross section and can be taken into 
account to all orders, if one approximates the emissions to be strongly ordered, as is done 
in parton shower models. However, when the strong ordering no longer holds, which is the 
case if we have several hard and widely separated jets, the parton shower models become 
unreliable. 
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In order to improve the description of multi-jet states, full matrix elements can be used. 
These describe the process correctly up to a given order in the strong coupling constant. 
However, the matrix elements become difficult to calculate for high parton multiplicities or 
if one goes beyond tree-level. They also contain divergences in the soft and collinear limits 
and need to be regulated using cutoffs. 

The idea behind merging algorithms is to let the matrix elements describe the hard 
emissions and use the parton shower to describe the soft and collinear emissions. To 
accommodate this, the phase space needs to be split into two well defined regions, one 
where emissions are generated by tree- level matrix elements, and one where emissions are 
generated by the parton shower. To avoid double counting and dead regions, the two 
regions should have no overlaps and together cover the entire phase space. The scale that 
describes the border between the regions is called the merging scale. 

Some extra care is needed to avoid an artificial dependence on the merging scale. The 
matrix elements contain divergences in the soft and collinear regions, whereas the emission 
probability in these regions in the parton shower is finite since it is regulated by Sudakov 
form factors. The effects of the Sudakov form factors need to be included in the matrix 
element part of the phase space as well, making the state from the matrix element exclusive. 
Also a running coupling similar to that of the shower needs to be introduced. If this is done 
correctly the dependence on the merging scale should be minimal, however, especially for 
corrections with several extra jets, a small residual dependence on the merging scale from 
sub-leading terms is basically unavoidable. 

There are four main algorithms that address the problem of merging tree-level matrix 
elements and partons showers: CKKW [1], CKKW-L [2], Pseudo-Shower [3], and MLM [4]. 
CKKW was first published for e + e~ collisions [1] and later extended to hadron collisions 
[5]. Implementations have been made in SHERPA [6] and in HERWIG [3,7]. CKKW has 
been used in several studies of vector boson production in hadronic collisions [3,8-11]. 
CKKW-L was also first published for e + e~ [2] and later extended to hadrons and applied 
to VF-production [11,12]. All the CKKW-L results so far have been calculated using the 
ARIADNE [13] implementation. The Pseudo-Shower scheme was published in [3], where the 
algorithm was described, and an implementation based on PYTHIA [14] was applied to both 
e + e~ and hadron collisions. The MLM algorithm has been implemented in ALPGEN [15], 
MADEVENT [16] and HELAC [17]. There is also one implementation based on HERWIG and 
MADEVENT, which was used in [3]. Several studies using MLM have been performed for 
heavy quark and vector boson production with incoming hadrons [3, 11, 18, 19]. To our 
knowledge no calculation applying MLM to e + e~ annihilation has been published. 

Of all the implementations, only the SHERPA implementation of the CKKW scheme 
and the ALPGEN and MADEVENT implementations of MLM are publicly available, while 
the others are obtainable from their respective authours upon request. 

There have been some assessments of the systematics of the various algorithms done 
already, but the main focus so far has been the case of hadron collisions. Although collisions 
with incoming hadrons clearly are the most interesting in light of the upcoming LHC 
experiments, they also include a lot of complications, such as uncertainties from PDFs and 
BFKL-like corrections, that obfuscate the basic properties of the merging algorithms. e + e~ 
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annihilation is much simpler from a theoretical point of view and is therefore more suitable 
for testing the basic properties of the different algorithms. We believe it is essential to test 
the algorithms for e + e~ annihilation before moving on to hadron collisions. 

Systematics for e + e~ have been published for CKKW-L [2] and for CKKW and Pseudo- 
Shower [3], but the systematics for the Pseudo-Shower approach was quite limited. These 
studies are extended in this paper, where the four algorithms listed earlier are applied to 
different parton shower implementations. This allows us to thoroughly test which algo- 
rithms live up to their promises. 

To study systematics with as little complications as possible we only look at the simple 
case of e + e~ — > qqg. In this case the matrix element correction is already implemented in 
most of the parton showers, using a simple reweighting of the hardest splitting 1 [20-23], 
which makes this process particularly suitable for testing the various algorithms. Ideally 
all the calculations should show small deviations and yield more or less trivial results. 
However, we find that this is not the case. 

Since the process studied is rather simple, algorithms that perform well should also 
be tested with more complicated processes before they can be reliably used to predict 
experimental observables. Some of the complications that can occur during the merging 
do not enter if only first order corrections are applied. However, if an algorithm does not 
perform well for this simple process, it is improbable that it will work reliably for more 
complicated processes. 

We note that besides the four merging algorithms presented here, there are also other 
ways of combining matrix elements and parton showers, e.g. the methods based on modifyig 
the tree-level or NLO matrix elements to match the parton shower (see e.g. [24-32]). Such 
matching algorithms may also have a dependence on an artificial matching scale. Although 
it may be interesting to also benchmark these algorithms using the simplest e + e~ — > qgq 
process, we have not done so in this paper 2 . 

In this paper we will concentrate on the behavior of the merging schemes in absolute 
numbers. It would also be interesting to study their formal properties in terms of leading 
double- and single-logarithmic contributions to various cross sections, which so far has 
only been done for the CKKW scheme [1]. We plan to return to such issues in a future 
publication. 

In this article we start by reviewing the theoretical aspects of the four algorithms in 
section H Then we move on to showing results from our implementations of the algorithms 
in section [3| and finally in section || we present our conclusions. 

2. Theory 

All of the algorithms considered in this paper aim to do a good job of merging matrix 
elements and partons showers. The issues that are addressed are the same, namely to split 
the phase space in a clean well defined way and to make the matrix element event exclusive 

1 We refer to these matrix element corrections of only the hardest splitting as reweighting while the more 
general schemes on trial here are referred to as merging. 

2 After we wrote this paper a new algorithm was published [33,34] which used a similar benchmarking. 
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by introducing Sudakov form factors or using some other similar suppression. The main 
aim is to minimize any artificial dependence on the merging scale. 

2.1 General Scheme 

The basic steps are common to all the algorithms and can be summarized as follows: 

1. Select a process to be studied and choose a scheme to cutoff the divergences in 
the matrix elements, typically using a jet measure. Specify the maximum parton 
multiplicity to be generated by the matrix elements (This is currently limited to five 
or six extra partons for computational reasons). Calculate the cross section for all 
the parton multiplicities. 

2. Select a parton multiplicity with a probability proportional to its integrated cross 
section. Generate kinematics according to the matrix element. 

3. Calculate a weight for the event based on the Sudakov form factors and the running 
coupling. Use this weight either to reweight the event or as a probability for rejecting 
the event. If the event is rejected generate a new event according to step 2. 

4. Find a set of initial conditions for the parton shower and invoke the shower. This 
may include a veto on the emissions from the shower. 

All the algorithms considered in this paper do steps 1 and 2 in the same way, but 
steps 3 and 4 are done using rather different approaches. Each algorithm has its own way 
of including the Sudakov form factors and finding a good set of initial conditions for the 
shower. 

One of the key features that distinguishes the algorithms is the choice of scales. The 
algorithms uses different definitions of scales when determining how to split phase space 
between the matrix element and the parton shower and in calculating the Sudakov form 
factors. Furthermore, these scale definitions may be different from what determines the 
ordering of emissions in the parton showers. We show later in this paper that the scale 
choices have significant consequences for how well the dependence on the merging scale can 
be minimized. 

The rest of this section describes the details of each algorithm and their consequences 
for the physics result. The descriptions of the algorithms are limited to e + e~ collisions, 
but, with a few modifications and extensions, they have all been used to calculate results 
for hadron collisions. 

2.2 CKKW 

The theoretical foundation for CKKW was published in [1], but the main points are re- 
peated here for completeness. CKKW is focuses on the Durham fcj_-algorithm for clustering 
jets in e + e~ [35], where the distance between two partons, i and j, is defined as 

Vij = 2mm(Ef,E])(l - cos6 l3 ) / E 2 CM . (2.1) 

The /c_|_-algorithm is used to construct a parton shower history from the event generated 
according to the matrix element. The algorithm generates a set of clusterings and corre- 
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sponding scales, which are later used to calculate the Sudakov form factors and the running 
coupling. 

Before going through the CKKW algorithm some notation needs to be introduced. T q , 
T g and Tf are the branching probabilities for q — ► qg, g — > gg and g — > // respectively. 
The Sudakov form factors are then given by 

rQi 

A,(Qi,Q 2 ) = exp(- / dqT^Quq)) (2.2) 
JQ2 
rQi 

A s (Qi,Q 2 ) = exp(- / dg(r s (Q 1 ,g)+r / (Q 1 ,g))). (2.3) 

They can be interpreted as the probability of a parton with the production scale Q\ not to 
have a branching above the scale Q 2 . Note the dependency on the production scale in the 
branching probabilities. This means that the Sudakov form factors used in CKKW do not 
factorize (A(Qi, Q 2 ) ■ A(Q 2 , Q3) 7^ A(Qi, Q3)). This is also the case for the Sudakov form 
factors in the angular ordered shower, where the limits on the integration of the splitting 
functions are dependent on the production scale. 

The idea of CKKW is to use the full matrix element for the branching probabilities 
and analytical Sudakov form factors above the merging scale, and the parton shower below 
the merging scale. The full CKKW algorithm is the following: 



1. Calculate cross sections and generate events according to step 1 and 2 in section 2.1 
VMS denotes the merging scale which is equal to the matrix element cutoff, defined in 
terms of the /cj_-measure in eq. (|2.1|). The events are generated using a fixed strong 
coupling, a s ME, and a maximum multiplicity, N. 

2. Construct a shower history by applying the fcj_-algorithm to the state from the matrix 
element. The algorithm is constrained to only allow clusterings of partons which are 
consistent with a possible emission from the parton shower. This yields a set of 
clustering values y 2 , ...,y n , where y 2 = 1 > 2/3 > ... > y n and n denotes the parton 
multiplicity of the event from the matrix element. Use the result from the clustering 
to determine a set of nodes where the partons are merged and the associated scales, 

4 = Vi E CM- 

3. Calculate a weight for the running coupling given by nr=3 a s(Qi)/ a sME- 

4. For each internal line of type i running between a node with scale qj and q^ apply 
a factor Ai(qj,qMs)/&i(qk,ctMs), where q^ s = y M s^cM- For external lines of type i 
starting from a scale qj apply the weight Ai(qj, (?ms)- These weights are the Sudakov 
form factors and they are calculated analytically. 

5. Reweight the event with the product of the Sudakov form factors in step 4 and the 
running coupling weight in step 3. 

6. Set the starting scale of each parton to the scale associated with the node in the 
shower history where it was produced. Invoke the shower and veto any emission 
which would give a /cj_-measure above ^ms- 
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The original CKKW procedure [1] contained no special treatment for highest multi- 
plicity events. This needs to be included since applying the veto on emissions in the shower 
down to the merging scale would prevent events with more than N jets above the merging 
scale from being generated. One way to resolve this is to modify the procedure for highest 
multiplicity events (n = N) and use the scale q n instead of c/ms in the Sudakov form factors 
and the vetoed shower. This is done in [3,36]. 

Another issue that needs to be addressed is effects related to the choice of ordering 
variable in the shower. The entire theoretical derivation in the CKKW publication [1] 
uses only one way of defining scales, namely the Durham k±. While the discrepancy from 
using a different ordering variable in the shower may cancel to some accuracy, this is not 
explicitly shown or discussed, even though the shower used in the publication is ordered in 
virtuality. The consequences of applying the scheme to a shower not ordered in Durham 
k± are therfore somewhat unclear. 

The original CKKW publication [1] claims that this procedure cancels the dependence 
on the merging scale to next-to-leading logarithmic (NLL) accuracy. However, this claim 
assumes that the formalism used, including Sudakov form factors, jet rates and generating 
functions, is valid at NLL, but this proof has never been published. 3 

2.3 CKKW-L 

The CKKW-L algorithm goes through the same basic steps as CKKW, but has a different 
way of calculating the Sudakov form factors and implementing the veto in the shower. 
In the CKKW-L scheme, a full cascade history with intermediate states is constructed. 
What is done is basically to run the cascade backwards and answer the question "how 
could the shower have generated this state?". This means that the ordering scale in the 
shower is used when clustering the partons. As in the CKKW case, only physically allowed 
clusterings are considered. However, contrary to CKKW where always the smallest scale 
is chosen for each clustering, all possible ordered shower histories are considered, and one 
is chosen according to a probability proportional to the product of the relevant branching 
probabilities. The chosen shower history is then used to calculate the Sudakov form factors 
and the running coupling. 

To be able to use this scheme, the parton shower needs to have well defined interme- 
diate states and it is also required that the Sudakov form factors factorize (A(Qi,Q2) ■ 
A(Q2,Qs) = A(Qi,Qz)). This is achieved if the Sudakov form factors only depend on 
the kinematics of the intermediate state rather than the production scale. This is for ex- 
ample the case in the dipole shower used in ARIADNE [13] and the p^-ordered shower in 
PYTHIA [37], but not for the angular ordered shower in HERWIG [7]. 

The effects of the running coupling are taken into account by reweighting with the 
same a s as in the shower with the constructed scales as input. The Sudakov form factors 
are introduced by using the shower to generate single emissions from the constructed states 
starting from the constructed scales and rejecting the event if the emission is above the 

3 The only reference leads to reference 22 in [35], which is marked "in preparation", and it has been 
confirmed by one of the authors of the article that it was never completed. 
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next constructed scale. This is known as the Sudakov veto algorithm and it is equivalent 
to accepting events with a probability equal to the Sudakov form factor, since by definition 
the no emission probability of the shower is equal to the Sudakov form factor. 

There are several advantages to this approach. One is that it makes sure that the 
Sudakov form factors above and below the merging scale match exactly and another is 
that any corrections introduced in the shower is also included in the Sudakov form factor. 
This is particularly useful if the splitting functions have been reweighted with matrix 
elements, since this makes it possible to completely cancel the merging scale dependence 
for first order correction (shown explicitly below). We also expect that the cancellation of 
the first order correction will lead to a smaller merging scale dependence for higher order 
processes even though the complete cancellation no longer holds. 

We use a slightly different notation in this section to emphasize the difference in the 
Sudakov form factors with respect to the ones in CKKW. As„(pi, P2) denotes a Sudakov 
form factor for an n-particle state giving the probability that there is no emission with 
a shower scale, p, between p\ and p2- The merging scale uses another notation, Qms, to 
emphasize that this scale does not need to be defined in terms of the emission scale in the 
shower. In fact, one could in principle use any partonic scale definition for the merging 
scale. These are the steps in the CKKW-L algorithm. 



1. Calculate cross sections and generate events according to step 1 and 2 in section 2.1 



Qms denotes the merging scale which is equal to the matrix element cutoff and may 
be defined using any choice of scale. The events are generated using a fixed strong 
coupling, a s MEi and a maximum parton multiplicity, N. 

2. Construct a full cascade history by considering all possible ordered histories and 
selecting one randomly with a probability proportional to the product of the branch- 
ing probabilities. If no ordered histories can be constructed, unordered ones are 
considered. This results in a set of intermediate states (S2, S3, ...S n ) and scales 
(P2 = Pmax, P3i Pn)- S2 denotes here denotes the constructed 2 — > 2 process and 
S n is the state given by the matrix element, n is the parton multiplicity in the event, 
Pmax is the maximum scale of the process and pi is the constructed scale where the 
state Si-i emits a parton to produce the state Si. 

3. Reweight the events with nr=3 a s(Pi)/ a sME- 

4. For each state Si (except S n ), generate an emission with pi as starting scale and if 
this emission occurred at a scale larger than pi+\ reject the event. This is equivalent 
to reweighting with a factor n£=2 ^-sXPi^ Pi+i)- 

5. For the last step there are two cases. 

• If the event does not have the highest multiplicity n < N, generate an emission 
from the state S n with p n as starting scale. If the emission is above the merg- 
ing scale QmSj reject the event. Otherwise accept the event and continue the 
cascade. 

• If the event has the highest possible multiplicity n = N , accept the event and 
start the cascade from the state S n with the scale p n . 
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The algorithm introduces all the factors that would have been present if the event had 
been generated by the parton shower, except the branching probability which is taken from 
the matrix element. To show how this comes about, a derivation of the parton multiplicity 
cross sections for a first order matrix element is shown below. The explicit reweighting of 
a s is not shown, but is straight forward to include. 

Let po denote the parton shower cutoff and Ts n (p) denote the probability that a 
state S n branches at scale p. The definition of the Sudakov from factor in this case is 
As n (pi, P2) = exp(— dpT s n (p))- The exclusive parton multiplicity cross sections gen- 
erated by the standard parton shower can be written as 

0-2 S (Pmax,Po) = CT A 5a (p max , p ) (2.4) 

/"Pmax 

0-3 S (pmax,Po) = °o / dp A s . 2 (p max , p) V s 2 (p) A S:i (p, p ) (2.5) 
J Po 

rPma,x 

0-£ S (Pmax,Po) = CT / dp A S2 (p m ax, p) ^S 2 (p) X 

J Po 

x [ P dp'As 3 (p,p')r S3 (p')As 4 (p',p ). (2.6) 
J po 



This notation can be generalized to include all higher multiplicity cross sections. Let 
/ 3_m (p, Po) denote the probability of the three-parton state to evolve into an n parton 
state between the two given scales. This means that a general n > 3 parton cross section 
can be written as 

/"pmax 

a™(p max ,po) = a / dpA 52 (p max ,p)r 52 (p)/ 3 ^ n (p,po), (2.7) 

J on 



where 



f- 3 (p, Po ) = A Ss (p, Po ). (2.8) 
/ 3 - 4 (p,p )= rdp'A S3 (p,p')Ts,{p')A Si {p\ Po ) (2.9) 

J on 



The merging scale may be defined using a different way of mapping the phase space 
(denoted Q) as compared to the scale used in the shower. The value of the scale used to 
define the merging scale and the scale of the shower can be related if the other variables 
that determine the shower emission is included. Let x represent the k additional variables 
used in the shower, which may include energy fractions and rotation angles. The branching 
probability can be written in a way that it includes the dependence on these variables as 
long as the following is true. 

T(p) = J d k xT(p,x) (2.10) 

The alternative mapping of phase space is described by the function Qs n (p, x), which 
denotes the value of the merging scale measure for a given shower emission. The lowest 
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order matrix element cross sections are equal to 

a 2 E (Pmax> Qms) = O"0 (2-H) 

cjf E (p m , x ,Qus) = vo dp d k xQ(Q S2 (p,x) - Q MS )TfV(p,x). (2.12) 

J po J 

Tc? E is the branching probability in the matrix element and is the standard Heaviside 
function. Note that the equations above only hold if the matrix element merging scale is 
above the shower cutoff everywhere in phase space, but this can be resolved by discarding 
all events from the matrix element which are below the shower cutoff. 

The next step is to apply the algorithm to calculate the jet rates at po for the merged 
matrix element and parton shower. The two-jet state is already the lowest order process, 
which means no cascade history is constructed and only the final Sudakov veto down to 
the merging scale enters. The parton multiplicity cross sections become equal to that of 
the pure shower minus the cross section for events with the first emission above Qms- The 
two-jet matrix-element contributions to the cross sections are equal to 



0-2(Pmax,/9o) = °0 ^S 2 (/»max, Po) ( 2 -13) 
0"nGOmax, Po) = dp d k X 0(QmS ~ Qs 2 (Pi ^)) ^S 2 (Pmax, P) X 



pa 



xT S2 (p,x)f^ n (p, P0 ). (2.14) 

When the algorithm is applied to a three-jet event, an emission scale, p3, is constructed. 
Emissions are generated from the two-jet state and events discarded according to the 
Sudakov veto algorithm, which is equivalent to introducing a weight As 2 (/3 max , p^). The 
cascade is then started from the scale pj, and, assuming the three-jet configuration is the 
highest multiplicity, no additional Sudakov suppression is included. Using the matrix 
element cross section from equation ( 2.12 ) and, assuming that the constructed scale ps is 
equal to the scale used in the matrix element, results in the following contributions to the 
parton multiplicity cross sections. 

0"n(Pmax,Po) = oj^iPmax, Qms) &S 2 (Pmax, Pz) f~™{PZ, Po) = 

C Pmax f 

/„ zt\ r\ \rME 



a dp d«xQ(Q S2 (p, x) - Qms) T%?(p, x) x 

J Po J 

xA S2 (pm^,p)f^ n (p,Po) (2.15) 



The following cross sections are the result from adding the contributions from the two- 
and three-jet processes. 

^ E+PS (Pmax,/O ) = ff0As 2 (Pmax,P0) (2.16) 

/"Pmax f 

UE+PS (p max ,Po) = <7o / dp d k x[rf*(p,x) Q(Qs 2 (p,x) - Qms) + 

J po J 

+T p s l(p,x)@(Q M s-Qs 2 (p,x))}^s 2 (Pn iax ,p)f^ n (p,Po) (2.17) 



From the equations above one can see that the only dependence on the merging scale 
Qms is i n the integration over the branching probabilities. This means that the merging 
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scale dependence for this process cancels to the accuracy of which the shower generates the 
first emission. In fact, for the process studied in this paper, many of the parton shower im- 
plementations reweight the splitting function for the first emission with the matrix element, 
in which case the merging scale dependence cancels completely. 

2.4 Pseudo-Shower 

The Pseudo-Shower algorithm is similar in spirit to CKKW-L, but it uses partonic jet 
observables for the kinematics instead of individual partons. Where CKKW-L runs the 
cascade one emission at a time from a given parton state to calculate no-emission proba- 
bilities, in the Pseudo-Shower approach a full parton cascade is evolved, and the resulting 
partons are clustered back to jets. Any standard clustering algorithm, where in each step 
the pair of particles which are closest together are clustered, can be used. The same dis- 
tance measure is used to define the merging scale as the one used to construct a shower 
history for the matrix-element state and the fully showered states. The full algorithm is 
defined as follows. 

1. Choose a jet clustering scheme to be used in the algorithm and a merging scale, dyis- 
Set the matrix element cutoff equal to the merging scale, calculate cross sections and 



generate events according to step 1 and 2 in section \2.1[ The events are generated 
using a fixed strong coupling, o s me, and a maximum multiplicity, N. 

Cluster the partons from the matrix element, using the selected jet scheme, until a 
2 — > 2 state is reached. As in CKKW, only clusterings corresponding to physically al- 
lowed splittings are considered. The result of the clustering is interpreted as a shower 
history with a set of states (S%, . . . S n ) and a set of scales (cfe = d max , cfe, . . . , d n ), 
where n is the parton multiplicity of the event. 

For each state Si, except S n , perform a full shower vetoing any emission with a jet 
measure greater than the corresponding scale in the shower history, d > di. Calculate 
a set of clustering scales, dj, by clustering the partons from the shower using the same 
algorithm as in step but for practical reasons also allow clusterings corresponding 



to non-physical splittings. Reject the event if ydi+i > y dj+i + 5 (the 5 is a fudge 
factor to be discussed below). 

4. For the final state S n the shower is invoked, vetoing emissions with d > d n . If 
the event is not a maximum multiplicity event (n < N), the partons are clustered 
and the event is rejected if the scale from the clustering is above the merging scale, 
y/d n +i > y/dyis + 8. For N = n the event is accepted except if \J d n+ \ > \f~dn + 5. 

Although it was not stated in the text in [3] , the implementation did include a reweight- 
ing with a running strong coupling Yli=3 a s(d~i) / ct^^. The same reweighting is also in- 
cluded here. 

To see the similarity with CKKW-L, consider using the Pseudo-Shower using a parton 
shower with an ordering variable equal to the distance scale in the clustering algorithm 
used, and with well-defined intermediate states. In the strongly ordered limit, the clustering 
algorithm would then exactly reproduce the intermediate states and branching scales in 
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the shower. This means that evolving a full shower from the state Si, starting from a scale 
di, clustering to find a di + \ and rejecting the event if di + \ > dj+i would exactly correspond 
to the Sudakov form factor Ag^dj, dj+i), and the merging scale dependence for first order 
matrix element corrections would cancel in the same way as CKKW-L. 

In reality, the clustering does not exactly reconstruct the shower splittings. If subse- 
quent emissions are not clustered in the same way as the shower emitted them, this can 
affect the clustering scale of harder emissions. This means that the scale of the matrix 
element partons before the shower and the scale that one gets from the jet clustering after 
the shower are rarely the same. This is a significant problem, since the phase space cuts 
that separate the matrix element emissions from the partons shower emission is done using 
two different scales, which leads to dead regions and double counting of emissions and it 
also affects the calculation of the Sudakov form factors. 

To moderate the effects of having two different scales, the fudge factor 5 (introduced 
in [3] ) was included whenever comparing a scale from the clustering of the matrix element 
state with a scale from the clustering of the fully showered state. In [3] the value 5 = 2 GeV 
was used without motivation, theoretical or otherwise, but supposedly the parameter needs 
to be tuned for each choice of process and merging scale to properly compensate for the 
mismatch in scales. 

2.5 MLM 

The MLM algorithm is similar to the Pseudo-Shower in that it also does matching with 
partonic jet observables. The algorithm is much simpler to implement compared to earlier 
schemes discussed. In the MLM merging scheme the event from the matrix element is 
simply fed into the parton shower program, the shower is invoked and the final state partons 
are clustered into jets. The algorithm then specifies that the matrix element partons should 
be matched to the final state partonic jets, and events are accepted only if all the jets match 
and the event contains no extra jets above the merging scale. In this way the Sudakov form 
factors are approximated by the probability that there are no emissions above the merging 
scale and, at the same time, the parton shower emissions are approximately constrained 
to be below the merging scale. The MLM algorithm is a really convenient way of doing 
merging since it requires no modifications to the parton shower program. 

Even though MLM has been frequently used, a general version of the algorithm has 
never been published and all the published algorithms assume incoming hadrons. Based 
on [4,11,19], we present here our interpretation of the necessary steps needed for applying 
the MLM scheme to e + e~ collisions. 

The first step in an MLM implementation is to choose a jet definition to be used for 
the merging scale and the matrix element cutoff. The original MLM algorithm used cone 
jet definitions, although there have been implementations using the /cj_-algorithm, e.g. the 
MAD EVENT implementation in [11] and the HERWIG implementation in [3]. After specifying 
a cutoff, events are generated and the running coupling reweighting is calculated in the 
same way as in the CKKW algorithm. 

Then the shower is invoked using an appropriate starting scale, which is defined for 
each implementations and process. For VF-production in hadron collisions, the scale is 
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set to the transverse mass of the W (y m^y + P±w) [H]> whereas for the top production 
implementation it is not specified [19]. In hadron collision there is some freedom for 
choosing the starting scale of the shower, but for e + e~ we think that the scale should be 
set to center of mass energy, to allow the shower to utilize the full phase space. 

After the shower has been invoked the final state partons need to be clustered into 
jets and matched to the partons from the matrix element. The clustering is done with the 
same algorithms used to define the merging scale. The partons from the matrix element 
are then matched to the clustered jets, in order of decreasing energy. The measure used to 
match the partons to jets is some quantity related to the jet clustering. These are all the 
steps in the algorithm. 

1. Select a merging scale, Qms> and a matrix element cutoff Q cu t, such that Q C ut < Qms, 
where the scales are defined using a jet algorithm. Calculate cross sections and 



generate events according to step 1 and 2 in section 2.1. The events are generated 
using a fixed strong coupling, « s me, and a maximum parton multiplicity, N. 

2. Cluster the partons from the matrix element using the /cj_-algorithm and use the 
clustering scales as in input to a s and reweight the event. 

3. Feed the event into a parton shower using the Les Houches interface [38], setting the 
scale to Eqm, and start the shower. 

4. Cluster the partons to jets using the algorithm from step 1 with a clustering scale 
set to Qms- Go through the list of partons, in order of decreasing energy, and match 
them to the clustered jets. This is done by finding the jet with the smallest distance 
to the parton defined using some measure based on the jet clustering scheme 4 . If not 
all the partons match or there are extra jets, reject the event. 

For the highest multiplicity events either use a higher clustering scale and more 
relaxed matching criteria or allow extra jets that are softer than the matched jets. 

There are several aspects of this algorithm that needs further explaining. The reason 
for having a cutoff below the merging scale is that events slightly below the merging scale 
can end up above after the shower. This leaves an arbitrary choice of matrix element 
cutoff, but this can be resolved if soft and collinear particles have a vanishing probability 
to generate an independent jet, which means that the result converge when the matrix 
element cutoff is lowered. This is a way of getting around the problem that occurred in 
the Pseudo-Shower algorithm, namely that the cuts on the matrix element state and on 
the partonic jets are not equivalent. However, the convergence needs to be verified for each 
implementation and process. 

One other aspect that needs further scrutiny is what happens inside the parton shower 
program. The main danger is that the program may be given a state with one or more 
relatively soft parton and a rather high starting scale, which means that the shower often 
ends up emitting harder partons than the ones already present, leading to an unordered 
shower. This breaks the strong ordering approximation, which is fundamental to all parton 



4 This cannot be exactly the same distance measure as in the jet algorithm for reasons to be discussed 
in section H 
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showers, and the end result is heavily dependent on how unordered emissions are handled. 

To derive some of the properties of MLM, let us assume (as we did in the Pseudo- 
Shower case) that the jet clustering is a perfect inverse of the shower and that the shower 
has well defined intermediate states. These assumptions are a bit crude considering the way 
MLM is used in current implementations, but it allows for the possibility to do analytical 
calculations and it should give some idea of what to expect from the algorithm. Under these 
assumptions parton multiplicity cross sections, including the first order matrix element 
corrections, can be calculated. 



The calculations are performed using the same notation as in the section |2.3| . The 
merging scale can be defined in terms of the scale in the shower (pms) an d starting the 
scale from the center of mass energy is equivalent to using the maximum scale (/0 max ). The 
two-jet matrix element contribution to the parton multiplicity cross sections becomes the 
same as in CKKW-L. 

0-2(Pmax,Po) = 0"O " A S2 (p ma , x , Po) ( 2 -18) 
rpwis 

maxi Po) — <^0 ' 

/ dpA S2 ( Pm ^p)Ts 2 (p)f- in (p,p ) (2.19) 

The contribution from the three-jet is different however. The reason is that there is no Su- 
dakov form factor from a two-particle state included since no shower history was considered. 
The contribution to the cross sections is the following. 

/"Pmax 

ff„0w,po) = <ro- dpTf 2 E (p)A S3 (p max ,p)f- n (p,p ) (2.20) 

J Pms 

The sum of the two contributions become the following. 



0-2(/Omax,Po) = " A S2 (pmax, Po) ( 2 - 21 ) 

rpmax 

O"n(/Omax, Po) = O"0 " / dp [Tf^(p) As 3 (p max , p)Q(p - p M s) + 

J PI) 

+T v s s 2 {p) A S2 (p max ,p)e(pus ~ P)\ f^ n (p,Po) (2.22) 



Comparing equation ( 2.21| ) to ( [2.4| ) one can see that the two-parton cross section becomes 



the correct one. Note that this would not have be the case if a lower starting scale was 
chosen for the shower. The higher multiplicity cross sections contain complications, which 
can be seen by comparing equation ( 2.22j ) to ( |2.7| ). The problem is that MLM does not 
include the Sudakov form factor from the S2 state, which means that there will be an 
additional dependence on the merging scale as a result of the difference in the Sudakov 
form factors. The factor As 3 (p m nx, p) is where the explicitly unordered shower occurs and 
the results are therefore largely dependent on the parton shower implementation. 

Consider, for illustration, using the /cj_-ordered shower of PYTHIA on a three-parton 
state. Here the maximum transverse momentum of an emission is given by half the largest 
of the qg and gq invariant masses, which for a soft gluon can be very small. Hence, 
the Sudakov form factor between Eqm and this transverse momentum would be absent, 
resulting in a large dependence on the merging scale. 
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The actual MLM implementations contain several other complications. The jet clus- 
tering used is not the inverse of the shower and most implementations use parton showers 
that do not have well defined intermediate states. However, none of these aspects can 
resolve the problem that the Sudakov form factor is generated using a three-particle state 
instead of a two-particle state. The error caused by using the wrong Sudakov form factor 
is inherent in any MLM implementation. 



3. Results 

Each of the algorithms described above have been tested for the first order matrix element 
correction to e + e~ — > qq at the Z° pole. As explained in the introduction, this matrix 
element correction can also be included by a simple reweighting of the first (or hardest) 
splitting in a parton cascade, thus providing us with the "correct" answer for comparison. 
In this way we can check whether the merging algorithms actually meet their goals of a 
clean cut between matrix element and parton shower phase space and a small dependence 
on the merging scale. Only if they do achieve these goals on this simple case, can we believe 
that they are likely to achieve their goals when generalized to higher order matrix elements 
and more complicated processes. 

To have a fair comparison, we have in all cases generated matrix element events using 
a cut in the Durham /cj_-algorithm distance measure, y, as defined in eq. ( |2.1| ). Also the 
merging scale is defined using this distance measure. The only exception is for the Pseudo- 



Shower algorithm, where a slightly different scale is used as explained below in section ^3 
The matrix element events were generated with MADEVENT (v 4.1.31) [16]. 

To check the merging scale dependence we look at the distribution which should be 
the most sensitive, namely the y% scale where the &j_-algorithm clusters three jets into 
two, when applied to the final parton-level events. We also look at two hadron-level event 
shape observables, which have been well measured at LEP and corrected to hadron level. 
One is the normalized y% distribution of all final state particles measured by ALEPH [39], 
which shows how the dependence on the merging scale is reflected in the hadronic final state. 
The other is the normalized charged particle thrust distribution measured by DELPHI [40] , 
which is not directly related to the merging scale, but is nevertheless very sensitive to the 
leading order matrix element correction. 

In figures |l] and |2] we present these distributions for the four generators ARIADNE (v 
4.12) [13], HERWIG (v 6.510) [7] and PYTHIA (v 6.413) [37], which all are equipped with 
simple matrix element reweighting. We see that they all agree fairly well, which should 
come as no surprise since they have all been tuned to fit LEP data. 

In figures and | we also show the results from PYTHIA, with the matrix element 
reweighting switched off to give a sense of how large an effects we should expect from the 
matrix element corrections. We see that the main effect is that these distributions are 
clearly harder than the ones with matrix element reweighting. 

We had planed to also include SHERPA [6] in this comparison and use it for the CKKW 
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Figure 1: The j/3 spectra at parton level for PYTHIA, PYTHIA with matrix element reweighting 
switched off, ARIADNE and HERWIG. 
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Figure 2: The 2/3 spectra for charged and neutral particles (a) and the charged particle thrust 
spectra (b) for PYTHIA, PYTHIA with matrix element reweighting switched off, ARIADNE and HERWIG 
compared to ALEPH and DELPHI data. The in-sets at the bottom of the plots show the relative 
differences between the Monte Carlo results and data, (<tmc — ODataVcData- 

results. Unfortunately we discovered inconsistencies 5 in the results and therefore decided 
not to use SHERPA at all in this study. 

3.1 CKKW-L 

We start by looking at the results from the CKKW-L scheme. Two implementations have 
been considered. We have used the original implementation [2] in ARIADNE and we have 

5 Both the version 1.0.10 and 1.0.11 of SHERPA give different results for the cross section depending on 
whether weighted or unweighted events are used. Changing between weighted and unweighted events also 
gives two different results for the shape observables in version 1.0.10, neither being consistent with the 
results from version 1.0.11. 
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also made an implementation of the first order corrections using the transverse momentum 
ordered shower in PYTHIA [37,41]. 

In ARIADNE the parton evolution is modeled by a dipole cascade [20,42]. Unlike 
most other parton showers, the dipole cascade is based around 2 — ► 3 partonic splittings 
rather than 1 — ► 2. This model automatically includes the coherence effects from emitting 
gluons from colour-neighboring partons, which means that explicit angular ordering is not 
necessary. It also means that the first order matrix element e + e~ — > qgq is already present 
in the cascade by construction. The ordering variable used in the cascade is a Lorentz- 
invariant transverse momentum measure 

P± = , (3-1) 

S123 

where s^,- are the invariant masses of the partons and index 2 indicates the emitted parton. 
This measure is then also used in the construction of the shower history in a procedure 
similar to the DlCLUS jet clustering algorithm [43], but including only physically allowed 
clusterings. 6 

We have also implemented the CKKW-L scheme using the transverse-momentum or- 
dered shower [41] in PYTHIA. 7 This cascade exhibits the main features required by the 
CKKW-L scheme, namely that it is ordered in transverse momentum and that is has well- 
defined on-shell intermediate states. The ordering variable (also called p±) is defined in 
the following way: 

p 2 ± = z(l-z)Q 2 (3.2) 

Q 2 is the invariant mass of the produced parton and radiating parton and z is the energy 
fraction of the radiating parton. The ordering of the shower is done in a way which 
incorporates coherence effects without requiring explicit angular ordering. 

There is one significant difference in the PYTHIA implementation, namely that the 
constructed history is no longer unique. In PYTHIA there are two possible ways for the 
gluon to be emitted (from the quark or the anti-quark), which means that there are two 
possible histories to be considered. One history is selected with a probability proportional 
to its branching probability. 

In figure |3] we show the parton-level distribution for ARIADNE and transverse mo- 
mentum ordered PYTHIA shower with matrix element reweighting. 8 Both programs are 
compared to their respective CKKW-L implementations with different merging scales, and 
the figure shows that the cancellation is almost complete. There are some small discrep- 
ancy for the lowest cutoff of the order of 2%. This is because MADEVENT generated the 
matrix element with massless u, d, s and c quarks, where as in ARIADNE and PYTHIA they 



6 For higher multiplicities, all possible shower histories are considered. However, for this simple case 
there is only one unique history 

7 Note that this is not a complete implementation, since only the leading order matrix element correction 
to e + e~ —* qq is considered. 

s We have used PYTHIA v 6.413, amended with a fix approved by the authors to avoid a bug in the 
built-in matrix element reweighting. This bugfix has been included v 6.414. 
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Figure 3: (a) The yz spectra at parton level for ARIADNE (the first order matrix element is 
included by construction) and for the ARIADNE implementation of CKKW-L corrections with dif- 
ferent merging scales, (b) is the same for our PYTHIA implementation of CKKW-L corrections 
with different merging scales. The in-sets at the bottom of the plots show the relative differences 
between the CKKW-L results and the default shower, (ockkw-l — cshowor)/cshower- 



are given a small mass, which causes a slight deviation in the emission probability. This 
discrepancy can be removed by setting the masses in ARIADNE and PYTHIA equal to the 
masses in MADEVENT. No deviations, however, are visible below the merging scale. 

For completeness we show in figure |] the comparison for the hadron-level observables 
?/3 and thrust for ARIADNE and p^-ordered PYTHIA respectively. As expected from the 
parton-level results, there is no serious dependence on the merging scale. 9 



3.2 CKKW 

The CKKW algorithm has been implemented using PYTHIA, with both the pj_-ordered 
and the virtuality ordered shower. The implementations are done according to the scheme 



described in section 2J2. The only difference is that none of the implementations use the 
Durham k± as ordering variables, therefore setting the starting scale is done differently. 
The scale of the quark and anti-quark is set to Ecm an d the scale of the gluon is set to the 
p± of the reconstructed splitting for the pj_-ordered shower and the virtuality and angle 10 
of the splitting for the virtuality ordered shower. For the latter, the scheme is essentially 
equivalent to what is implemented in SHERPA. The results are shown in figure ||. 

The results from the p± ordered shower show a smooth transition between the regions 
above and below the cutoff. This is to be expected since Durham k± is approximately 
equal to the p± in the shower, which means that the entire procedure becomes similar 
to CKKW-L with the exception that the Sudakov form factors are calculated according 



9 The PYTHIA px-ordered shower has not been properly tuned to LEP data, and we therefore do not 
compare it directly to data. However, comparing with figures ^ja and b, it is clear that the variations due 
to the merging scale are well within the experimental errors. 

10 Besides having the virtuality as ordering variable, PYTHIA also imposes a veto on emission angles to 
ensure angular ordering. 
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Figure 4: Charged plus neutral particle j/3 and charged particle thrust for the ARIADNE and 
PYTHIA implementation of CKKW-L corrections with different merging scales. Figure (a) includes 
a comparison to ALEPH data and (b) a comparison to DELPHI data, where figure (c) and (d) only 
compare to the default PYTHIA, since the pj_-ordered shower has not yet been tuned to data. 

to an analytical approximation. The CKKW results show a slightly higher cross section 
than standard PYTHIA, which is attributed to the fact that in the analytical Sudakov form 
factors the approximate splitting functions are integrated over parts of phase space where 
they are negative. This means that Sudakov form factors cannot be interpreted as no- 
emission probabilities in the same way as in the shower and the end result is a slightly 
smaller suppression. 

Figure |H|b shows the results from the virtuality ordered shower and there are clear 
problems. Each CKKW curve has a dip right at the merging scale and for higher values 
of ys they are significantly above the default results from PYTHIA. One source of problems 
is that the shower can generate unordered emissions, since an emission which has a high 
vitruality can have a low k±. This also changes which Sudakov form factor is applied to 
each region of phase space. A derivation of the problems that this can lead to for the lowest 
order process is presented in appendix [A|. 

To further scrutinize the causes of the problems in the virtuality ordered CKKW 
implementation, we have studied the variable used for the shower veto which is the k±- 
value of the individual emission. Figure ^ shows the maximum Durham A;j_-value for the 
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Figure 5: The t/3 spectra at parton level for PYTHIA with pj_-ordered (a) and virtuality ordered 
(b) shower (and with the first order matrix element reweighting included) and for our corresponding 
PYTHIA implementations of CKKW corrections with different merging scales. 
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Figure 6: The maximum Durham k±-value of any emission in the cascade for PYTHIA with p±- 
ordered shower (and with the first order matrix element reweighting included) and for the two- and 
three-jet components in our PYTHIA implementation of CKKW corrections with a merging scale 
Ums = 10~ 2 . An extra curve is included to show the two-jet contribution without the suppression 
from the analytic Sudakov form factor. 

emissions in the shower. The results have been split into two and three-jet components of 
the CKKW implementation using PYTHIA p± and virtuality ordered shower. 

The two-jet events in CKKW is produced by invoking the shower vetoing emissions 
above the merging scale and reweighting with an analytical Sudakov form factor. These 
two steps can be considered separately, which is illustrated in figure |6| by having a curve 
that only includes the veto and not the reweighting. 

For the shower ordered in transverse momentum the three-jet contribution match the 
curve from running default PYTHIA fairly well above the merging scale. This is expected 
since the ordering in the transverse momentum ordered shower is fairly close to using 
Durham /cj_-ordering. The two-jet contribution is similar in shape but is higher in cross 
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section, which can be attributed to a smaller suppression from the analytical Sudakov form 
factors as compared to the form factors used in the shower. There is a discontinuity at 
the merging scale, but it is smoothed out by the shower causing the figure |5|a to appear 
somewhat smooth. 

With the virtuality ordered shower the results are quite different, which is caused 
by using two rather different scales. The problem is that emissions modifies the phase 
space for subsequent emissions, and by emitting the partons with highest virtuality first, 
the hardest emissions in k± are no longer allowed. When studying the hardest emission 
according to Durham k±, this results in a shift to smaller values. This is why the PYTHIA 
curve is significantly below the three-jet CKKW curve in figure ^o. The sum of the two 
and three-jet contributions in figure ||b has two peaks with a clear dip in between. When 
the rest of the emissions are included this structure is somewhat smothed out, but the two 
peaks with a dip in between are clearly visible in figure ^p. 

It is clear that one has to be a bit careful regarding the choice of shower and scales 
in CKKW. One CKKW implementation based on HERWIG was published in [3] and the 
results were consistent only after a significant amount of tuning of scale parameters. A 
similar procedure could probably be used to make our results more consistent, but it would 
add extra somewhat arbitrary parameters to the model. The problem with the ordering 
of the emissions also leads to a different colour structure in the events, which was pointed 
out in [26]. 

Finally we show the consequences for two experimental observables in figure 0. Again, 
the results of the p_|_-ordered shower has only been compared to the default PYTHIA, since 
it is not yet tuned to data. The pj_-ordered plots show significant lower values at high 
and low thrust, which is the result of the excess in cross section for two-jet events. We also 
see some trace of the discontinuities in figure ||a, but they have been smoothed out by the 
shower and hadronization. The results from the virtuality ordred shower shows the same 
dips in the 2/3 distribtion as in the parton-level plots. The dips are not visible in the thrust 
plot, but there are significant deviations from default PYTHIA. 

3.3 Pseudo-Shower 

We have implemented the Pseudo-Shower [3] algorithm using matrix element events gen- 
erated with MADEVENT and using PYTHIA with both virtuality ordered and pj_-ordered 
parton shower. The implementation has been done with the following definition for the jet 
clustering. 

dij = mfjEiEj/iEi + Ej) 2 (3.3) 

This is equivalent to the measure in the LUCLUS algorithm 11 (and to the fcj_-definition in 
eq. ( |3.2D ) in the limit of massless particles. The jet algorithm was used both to define the 
merging scale and the clustering of jet observables as specified by the merging scheme. a s - 
reweighting has been introduced using the scales from the jet clustering, which is happens 
to be the same scale definition that is used in the a s evaluation in the PYTHIA shower. 

"Originally included in the JETSET program [44], now a part of PYTHIA as the PYCLUS routine. 
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Figure 7: Charged plus neutral particle 1/3 and charged particles thrust for our PYTHIA implemen- 
tation of CKKW corrections with transverse momentum and virtuality ordered showers and using 
different merging scales. Figure (c) includes a comparison to ALEPH data and (d) a comparison to 
DELPHI data, where figure (a) and (b) only compare to the default PYTHIA, since the transverse 
momentum ordered shower has not yet been tuned to data. 



The first thing that is investigated is the effects of different values of the fudge factor 
5 introduced in section 2.4. The most sensitive distribution for checking the effects is the 
same variable as the merging scale. Figure [||a and ||c show the (I3 distribution at parton 
level for three different values of 8. In both figures the curve that shows the smallest 
deviations is 5 = 1 GeV, and this value is used in the rest of this section. It is clear that 
there is no smooth transition between the matrix element and parton shower phase space. 

Figure |8| also showes what happens if the merging scale is varied. The discontinuities 
persist for all three values of the merging scale and for the two different showers and the 
results are heavily dependent on the merging scale. The problem is a conseqence of using 



different ways of defining the scales in the algorithm, which we discussed in section 2.4 



The results in the rest of this section focuses on the Pseudo-Shower implemenation with a 
vituality ordered shower, since this was used in the original publication [3]. 

To demonstrate the different scales used in the algorihtm, the Pseudo-Shower results 
have been split up into the contributions from the two- and three-jet matrix elements. 
Figure Pa shows the d% distribution, where it is clear that the two-jet contribution displays 
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Figure 8: The c?3 spectra at parton level for PYTHIA (with first order matrix element reweighting 
included) and for our implementation of the Pseudo-Shower algorithm using both the vituality 
ordered and the p_L-ordered shower, (a) and (c) show the effects of different valuses of the fudge 
factor 5 with a merging scale of V^ms = 5 GeV. (b) and (d) show the results from different values 
of the merging scale c?ms using a fudge factor 6 = 1 GeV. 



a sharp cut, but not the three-jet component. On the other hand, when the largest d-value 
of the emissions in the shower is plotted in figure ^b, the three-jet component has a sharp 
cut but not the two-jet curve. This illustrates that different scales are used for the different 
jet multiplicities and this is the reason for the problems that appear close to the merging 
scale. 

To further illustrate the complication with mixing scales for parton splittings and 
partonic jets, we show in figure the variation of the ratio between the scale on 
partonic jet level and the largest generated scale in the parton shower d > . In a strongly 
ordered shower this ratio should ideally be unity, especially for the p^-ordered PYTHIA 
shower in figure ICa. However, we find that this is far from the case. Especially for the 
virtuality ordered shower in figure [Tc|b, the correlation between the different scales is very 
weak. 

Finally two experimental observables are plotted, namely the normalized 7/3 distribu- 
tion for charged and neutral particles and the charged particles thrust, which are shown in 
figure [ll|. Hadronization smoothes out most of the discontinuities shown in earlier figures, 
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Figure 9: The Pseudo-Shower results, with V^ms = 5 GeV and S — 1 GeV, split into two- and 
three-jet components compared to the default shower in PYTHIA (with first order matrix element 
reweighting included), (a) shows the d% distribution at parton level and (b) is the largest d- value 
of the emissions in the shower. 




Figure 10: The average ratio between the clustering scale for partonic jets and the generated 
maximum splitting scale P3 in the cascade as a function of the latter. The shaded area indicates 
the standard deviation of the ratio. Both scales are defined as in eq. (^). (a) is for the pj_-ordered 
shower in PYTHIA, while (b) is for the virtuality ordered one. 

but it is clear that there are still problems. The plots show deviations from data and the 
results are not be independent of the merging scale. 

3.4 MLM 

To test MLM for e + e~ a merging scheme and a parton-jet distance need to be chosen. 
Most of the MLM implementations use cone algorithms to achieve this purpose. In the 
case of e + e~ , we think that a /cj_-based clustering algorithm is a better choice. We have 
therefore decided to use the Durham k± algorithm to define the merging scale and matrix 
element cutoff. 

When it comes to selecting a measure for the parton-jet distance, the natural choice 
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Figure 11: The charged plus neutral particle spectra (a) and charged particle thrust spectra 
(b) for PYTHIA (with first order matrix element reweighting included) and for our implementation 
of the Pseudo-Shower algorithms with different merging scales and 8 = 1 GeV compared to ALEPH 
and DELPHI data. 

would be the /cj_-distance between the jet and the parton. The problem with this approach 
is that since the &j_-measure includes the minimum of the two energies it means that soft 
partons can match jets at very wide angles, which leads to problems with convergence. 
Instead we modified the distance to only use the jet energy. 

2/jet,parton = 2-Ej et (l — COs(#j e t,parton))/-EcM (3.4) 

For the highest multiplicity treatment we follow the MADEVENT implementation in 
[11]. For each highest multiplicity event, jets are reconstructed at a scale which is the 
maximum of the merging scale and the smallest distance between the partons in the matrix 
element level event, max(?/MSi Un)- This scale is also used when matching the jets to 
partons. This allows extra jets to be produced if they are softer than those from the 
matrix element. 

The algorithm has been implemented using MADEVENT to generate the matrix element 
event and both HERWIG and PYTHIA has been used to shower the events. Before we move on 
to the results of the merging, some important aspects of the showers need to be explained. 

Step H in the algorithm (described in section [2.5| ), where the shower is invoked, depends 
on how the state received from the matrix element is treated in the parton shower program. 
For the analysis of the merging it is important to understand how the events are treated 
internally. In particular the states containing soft and collinear partons require extra 
scrutiny since the matrix element cross section is divergent in these regions. 

In the PYTHIA implementation of the Les Houches interface, the limiting factors for 
the radiation are the scale from the Les Houches interface, which is used as a veto, and 
the energy of the partons, which become the maximum kinematically allowed value for the 
invariant mass. No extra vetoes are applied for emissions at a wide angles. This means 
that even when soft or collinear partons are present some events have a lot of emissions, 
assuming that the scale in the Les Houches interface is set to a high value. Figure [T^ a 
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Figure 12: The y% spectra at parton level for PYTHIA (a) and HERWIG (b) when starting from 
a state with a soft or a collinear gluon with a merging scale of y = 1CP 3 and the scale in the Les 
Houches interface set to Eqm- 



shows the 1/3 spectra for a state with a soft gluon and for one with a collinear gluon with 
2/3 = 1CP 3 and the scale set to Eqm- It is obvious from the figure that a soft or collinear 
parton does not limit the emissions significantly. 

HERWIG uses a different strategy for the internal choices of scale. Each parton is given 
a starting scale which is equal to the product of the four-momentum of itself and its color 
neighbour (if a parton has two color neighbours one is chosen at random). The parton 
is then boosted to the frame where it is at a right angle compared to its color neighbour 
and the shower invoked. Any scale set in the Les Houches interface is included as a veto 
on the transverse momentum (approximately given by eq. ( |3.3D ) of the emissions. The 1/3 
histogram from a HERWIG shower are shown in figure |i~2| b starting from the same states 
and from the same scale as in figure |l~2|a. From the figure one can tell that a soft parton 
limits the emission from the shower quite drastically, but for a collinear parton about half 
of the events show significant jet activity. 

Clearly, the way soft and collinear events from the matrix element give rise to hard 
emissions influences the jet matching veto, which is assumed to give the Sudakov sup- 
pression of these events, and the different choices of scales in the shower can have a big 
impact on these Sudakov form factors. This means that one has to be very careful with 
generalizing the MLM algorithm to different showers. 

The first thing that needs to be verified with our MLM implementation is if the results 
converge when the matrix element cutoff is lowered. The results of the merging are shown 
in figure 13 for PYTHIA and HERWIG, with a merging scale of ys = 10~ 2 and three different 
values for the cutoff. No sign of convergence is visible in the figure. 

The problem is related to how HERWIG and PYTHIA treats input with soft or collinear 
partons. The MLM algorithm assumes that soft and collinear partons cannot produce 
independent jets when fed into the shower. From the earlier figures it is clear that PYTHIA 
can produce extra jets both for soft and collinear partons, whereas HERWIG can only produce 
extra jets for collinear partons. This is the reason why PYTHIA diverges faster. 
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Figure 13: (a) The 2/3 spectra at parton level for default PYTHIA and our PYTHIA implementation 
of the MLM algorithm for i/ms = 10~ 2 and for different values of the matrix element cutoff, (b) 
shows the same for HERWIG. 



The changes to the parton-level spectra is also visible in the hadron-level observables, 
shown in figure It is clear that changing the matrix element cutoff also affects the event 
shape, which diminishes the predictive power of the model. 

In figure 15 the parton-level y3-spectra is shown with a lower merging scale. Generally 
one can say that the discrepancies as compared to the standard PYTHIA and HERWIG become 
even bigger. In HERWIG, configurations with soft gluons cannot generate hard emissions, 
which means a large fraction of these events are kept. There is a pole in the soft gluon 
limit and the cross sections for these events are therefore rather large, which leads to a 
continued rise of the cross section of the MLM-corrected curve for low values of 1/3 , clearly 
visible in figure |l5|b. 

The examples above show that the MLM algorithm is very sensitive to how emission 
are generated and it appears to be quite tough to fix this problem. If the shower generates 
an emission it risks influencing things at higher jet scale and if emissions are suppressed 
it is not possible to control the rise of the cross section near the soft and collinear limits. 
In fact, even if the problems described in this section can somehow be resolved, the more 
fundamental problem, described in section 2.5, that MLM uses the wrong type of Sudakov 
form factors remains unsolved. 



4. Conclusions 

In the previous sections, we have studied in some detail the behavior of four suggested 
algorithms, or schemes, for merging fixed order, tree-level matrix element generators with 
parton shower generators. We have done so by considering the simplest possible case of the 
leading order correction to the e + e~ — * hadrons process. This may not be an important 
use case for these merging schemes, but since it is such a simple process it is fairly easy 
to check whether the schemes actually accomplishes what they set out to do, namely to 
correctly populate the phase space above the merging scale with partons described by 
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Figure 14: The charged plus neutral particle 2/3 spectra (a) and the charged particle thrust spectra 
(b) for default PYTHIA and our PYTHIA implementation of the MLM algorithm for three different 
values of the matrix element cutoff. The HERWIG results are shown in (c) and (d) respectively. 

the full matrix element, and with emission from the parton shower below, and that they 
do so while correctly resumming large logarithmic contributions from soft and collinear 
divergencies. In addition, for this process we also have the "correct" answer available, 
obtained by a simple reweighting of the hardest emission in the parton shower. 

Our main finding is that of the four schemes considered, only CKKW and CKKW-L 
meets the requirements and that even CKKW has some problems when using a parton 
shower with an ordering variable which is very different from the clustering variable used 
for the merging scale. 

For CKKW-L it was claimed in [2] that the cancellation of the dependence on the 
merging scale is complete when using the ARIADNE dipole shower. Here we show this 
explicitly and also show that it is still true when using the transverse momentum ordered 
shower in PYTHIA. This is not a completely trivial result, because it involves an additional 
ambiguity in how to reconstruct the shower history for PYTHIA which is not present in 
Ariadne. 

In the CKKW case we also find a near complete cancellation of the merging scale 
dependence when using the p^-ordered PYTHIA shower. However, when using the virtuality 
ordered shower in PYTHIA, we find a clear mismatch near the merging scale, and we trace 
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Figure 15: (a) The spectra at parton level for default PYTHIA and our PYTHIA implementation 
of the MLM algorithm for t/ms = 10~ 3 and for two different values of the matrix element cutoff, 
(b) shows the same for HERWIG. 

this mismatch to different treatments of the no-emission probability from the gluon when 
it is emitted above and below the merging scale, and from possible problems in the parton 
shower when unordered emissions are allowed. These problems occur when the ordering 
variable is very different from the Durham /cj_-measure used for the merging scale, which 
is true for the virtuality ordering, but not for the pj_-ordered shower. We expect that this 
is also the origin of the mismatches found in the CKKW implementation in HERWIG [3], 
where an elaborate tuning of the scales used in the Sudakov and ce s -reweighting was needed 
to minimize the merging scale dependence. Most likely a similar retuning can be done in 
the case of the virtuality ordered shower in PYTHIA. 

For Mrennas Pseudo-Shower scheme, we find that the problems are much more severe. 
The main problem is that cuts made on parton level for the matrix element are made on 
the partonic jet level for the parton shower, and we show that this can never give the clean 
separation of phase spaces needed to have independence on the merging scale. This leads to 
a severe underestimate of the three-jet rate just above the merging scale. The fudge factor 
introduced in [3] can be tuned to hide the problem, but at the expense of overestimating 
the three-jet rate below the merging scale due to double-counting. 

The MLM scheme is the simplest one to implement, in that it allows the use of any 
parton shower generator without modifying its internal behavior. It also uses partonic jet 
level cuts, but contrary to the Pseudo-Shower it tries to avoid mixing them with cuts on the 
few parton level. Nevertheless, an extra parton-level generation cut is needed for the matrix 
element. Supposedly the results should be independent on this generation cut as long as 
it is sufficiently smaller than the merging scale. However, we have found that this is not 
the case, and that the result is very sensitive to how the kinematics of the initial parton 
state limits emissions from the chosen parton shower. If the emissions are not limited, 
events generated close to the generation cut will always have a finite possibility to end 
up above the merging scale, which makes the scheme very sensitive to the generation cut. 
If, on the other hand, the emissions are limited by the kinematics, there is no possibility 
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to obtain the necessary Sudakov suppression of soft and collinear divergencies. It is not 
inconceivable that the generation cut can be tuned, possibly together with the scale used 
in the a s -weighting, to get reasonably smooth distributions, but we feel that this is just 
hiding the fact that the MLM scheme has serious flaws. 

Having done this investigation of the behavior of the algorithms for the simplest possi- 
ble process, the question is if we can make some conclusions for more complicated processes. 
Or course, if a scheme does not handle this simple process well, it is not likely that the 
situation will improve for more complicated ones. But also in the case of CKKW-L, com- 
plications may occur. 

As we go to higher parton multiplicities in the matrix element, we expect that the 
actual matrix element correction becomes larger, necessarily giving rise to discontinuities 
near the merging scale. These discontinuities should disappear as the merging scale is 
lowered and the parton shower splittings become a good approximation to the full matrix 
element. However, for this to work there must not be any artificial dependencies on the 
merging scale in the merging algorithm as such, which we have seen is not the case for all 
schemes. 

The really interesting processes are in hadron collisions, where e.g. the standard model 
production of a W together with several hard jets is an important background to almost 
any search for new phenomena. Here the matching is complicated as the parton showers 
also includes initial state evolution of the incoming partons. Also, care must be taken to 
treat the parton densities in a consistent way. This was first investigated for the CKKW 
scheme in [5], and later for ARIADNE implementation of CKKW-L in [12]. For the latter 
it was shown that the merging scale dependence does indeed cancel for the leading order 
correction to H^-production. Although it has not been checked explicitly for the other 
schemes, there is no reason to expect that the problems we have found in this paper will 
go away. 

Having said all this, we must ask ourselves how severe the deficiencies we have found 
here would be for practical applications at e.g. the LHC. We are, after all, dealing with 
tree-level matrix elements and leading log parton showers which means we will in any 
case expect large scale dependencies. And surely these deficiencies results in much smaller 
uncertainties as compared to using parton showers without matrix element corrections. 
Indeed, it was shown in [11] that CKKW, CKKW-L and different MLM implementations 
all give consistent results for realistic experimental observables within reasonable variations 
of the scales used in a s and parton densities. In fact the most severe disagreement found 
was for the ARIADNE implementation of CKKW-L. However, this had nothing to do with 
the merging scheme, but is a consequence of the radically different treatment of the phase 
space available for initial-state radiation in ARIADNE as such, which includes a resummation 
of some logarithms of x, not present in conventional initial-state showers. 

In [1] a large effort was put into showing that the dependence of the merging scale 
vanishes at next-to-leading logarithmic accuracy for CKKW, at least for a shower which is 
ordered in the Durham k± variable. Here we have shown that for the three-jet observables 
considered here, the dependence vanishes completely for CKKW-L. For the Pseudo-shower, 
the situation is more difficult to analyze. In the strongly ordered limit we argue that it is 
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equivalent to CKKW-L, which indicates that the dependence on the merging scale should 
cancel, at least to leading double-logarithmic accuracy, nevertheless the dependence is 
quite large in absolute numbers. For MLM, it is clear that for the observables considered 
here, it is questionable if it can reproduce the correct Sudakov form factors, indicating a 
dependence on the merging scale (or rather the cutoff in the matrix element generation) 
already on the leading double-logarithmic level. We plan to return in a future publication 
with a more formal investigation of the logarithmic accuracy of the merging scales. 

In absolute numbers, we have seen that the Pseudo-Shower and the MLM schemes, and 
even CKKW for some parton showers, have problems with correctly populating different 
regions of phase space. And even if this can be smoothed out by introducing additional 
cuts on the matrix element generation (for MLM), a fudge factor (for Pseudo-Shower) or 
separately varying the scale in the a s and Sudakov reweighting (for CKKW with HERWIG 
[3]) it necessarily means introducing extra parameters which need to be tuned. Indeed, it 
is not inconceivable that these extra parameters need to be tuned differently for different 
processes and merging scales, and maybe even for different observables. As a consequence, 
the predictability of the models will be reduced. 
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A. Different scale definitions in CKKW 



To illustrate the origin of the discontinuities found for CKKW when used with the 
virtuality-ordered shower in PYTHIA, we here look at the somewhat artificial example of 
calculating the differential exclusive three-jet rate, where we use the notation introduced 



in section 2.2. 



Assume the parton shower has ordering in p, and has auxiliary splitting variables, x, 
and that the Durham fcj_-measure can be written as a function of these variables, y = 
y(p,x). Now we can write the differential exclusive three-jet rate above some po for the 
plain parton shower case: 

drr PS 

= A™(p max ,p)T p s s 2 (p,x)A™(p,p )dpd k x (A.l) 

where As 2 (p max , p) is the Sudakov corresponding to the no-emission probability from the 
two-parton state above p, and A$ 3 (p, po) is the no-emission probability from the three- 
parton state below p. We can also write the corresponding rate for the CKKW scheme: 

^ ra A™( ymax , y )F^ (/) ,f) x 

CO 

A p s l(p max ,p; < y)A^(p,p ; < y)dpd k x (A.2) 
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where A^ KW (y max , y) is the analytic Sudakov form factor used for reweighting, and 
Ag^(p, po; < y) is the parton shower no-emission probability from the three-parton state 
between p and po excluding phase space region corresponding to the vetoed emissions 
with Durham k± above y(p,x). A^(p max ,p; < y) is the corresponding no-emission prob- 
ability for the two-parton state, which approximates the no-emission probability for the 
three-parton state above the scale p where the gluon is not allowed to radiate — an ap- 
proximation which is valid as long as the gluon is not too hard. Now the CKKW Sudakov 
can be approximately written as the no-emission probability 

Ag 2 KKW (y max , y) ~ A| 2 S (p max , p ; > y) = Ag 2 S (p max , p; > y)A F s *(p, p ; > y), (A.3) 

where the > y notation indicates that only emissions with Durham above y are consid- 
ered. Also, we can write 

A£ 2 S (/w, p) = A| 2 s (p max , p; > y)A^ 2 s (p max , p; < y) (A.4) 

and 

As 3 S (p,Po) = A p s f(p,p ; > y)A p s f(p,p ; < y) (A.5) 
We can now rewrite the two three-jet rates as 

= T^{p, x)A| 2 S (p max , p; > y)A| 2 S (p max , p; < y) x 



0o 



As 3 S (P,Po; > y)A p s *(p,p ; < y)dpd h 



x 



^CKKW 

= rf*(p,x)A p s l(p max ,p;> y)A p s l(p,p ;> y) x 



0"o 



A| 2 s (p max ,/9; < y)A^(p,p ; < y)dpd k x (A.6) 



And we find that the only difference, besides the desired T p ^ — ► r^ E , is that in the plain 
shower has a no-emission probability A<^(p, po;> y) where CKKW has A p ^(p, po;> y). 
This can be seen as an extra suppression in the plain parton shower due to the no-emission 
probability from the gluon at a shower scale below p but at a Durham A;j_-scale above y. 
Now if the ordering variable p is close to the Durham k±, this region will be very small 
and the mismatch will be small, as we saw in the case of the pj_-ordered PYTHIA shower. 
But for the virtuality ordering in PYTHIA we have 



12 



1 1^-^)4 (A - 7) 



and similarly for the angular ordering variable, £, in HERWIG we have 

y = min(z 2 ,(l-z) 2 )-^-, (Ai 



CM 



(where E is the energy of the parent parton in the splitting) and, hence, the region can 
become quite large, especially in the regions z — ► or 1, where the probability of an 
emission is large. We note, however, that in PYTHIA the effect is limited by the presence 
of an additional angular ordering veto in addition to the virtuality ordering. 
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